Between Sound and Spelling: Combining Phonetics and Clustering Algorithms to Improve Target Word Recovery

نویسندگان

  • Marcos Zampieri
  • Renato Cordeiro de Amorim
چکیده

In this paper we revisit the task of spell checking focusing on target word recovery. We propose a new approach that relies on phonetic information to improve the accuracy of clustering algorithms in identifying misspellings and generating accurate suggestions. The use of phonetic information is not new to the task of spell checking and it was used successfully in previous approaches. The combination of phonetics and cluster-based methods for spell checking was to our knowledge not yet explored and it is the new contribution of our work. We report an improvement of 8.16% accuracy when compared to a previously proposed spell checking approach.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fuzzy Clustering Approach Using Data Fusion Theory and its Application To Automatic Isolated Word Recognition

 In this paper, utilization of clustering algorithms for data fusion in decision level is proposed. The results of automatic isolated word recognition, which are derived from speech spectrograph and Linear Predictive Coding (LPC) analysis, are combined with each other by using fuzzy clustering algorithms, especially fuzzy k-means and fuzzy vector quantization. Experimental results show that the...

متن کامل

ارائه یک الگوریتم خوشه بندی برای داده های دسته ای با ترکیب معیارها

Clustering is one of the main techniques in data mining. Clustering is a process that classifies data set into groups. In clustering, the data in a cluster are the closest to each other and the data in two different clusters have the most difference. Clustering algorithms are divided into two categories according to the type of data: Clustering algorithms for numerical data and clustering algor...

متن کامل

Integrating Speech with Keypad Inpu of Spelling and Pronunciatio

This paper describes research whose ultimate aim is to support automatic entry of new words into a spoken dialogue system through interaction with a user. This research demonstrates an important step towards this goal, through a procedure which integrates information made available via the telephone keypad with a spoken instance of the target word, to produce a candidate spelling and pronunciat...

متن کامل

Cross-linguistic adaptations of The Comprehensive Aphasia Test: Challenges and solutions.

Comparative research on aphasia and aphasia rehabilitation is challenged by the lack of comparable assessment tools across different languages. In English, a large array of tools is available, while in most other languages, the selection is more limited. Importantly, assessment tools are often simple translations and do not take into consideration specific linguistic and psycholinguistic parame...

متن کامل

Linguistics Effects of Target - Word Frequency Rate on Sound - Meaning - Connection in Five to Fifteen Month

The purpose of this study was to examine the effects of manipulating target-word frequency rate and target-word phrase position on sound-meaning-connection in five to fifteen monthold Swedish infants. Three different test conditions, each one of them a film showing objects and corresponding phrases made of randomly generated artificial words, were designed. The structure of the first, high vari...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014